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MORE ECONOMICAL RESOURCE APPLICATION ON THE USER 
INTERACTION WITH A SPEECH DIALOGUE SYSTEM IN A PACKET 
NETWORK BY MEANS OF A SIMPLIFYING PROCESSING OF SIGNALLING 
INFORMATION 

CROSS REFERENCE TO RELATED APPLICATION 
[0001] This application is the US National Stage of International Application No. 
PCT/EP2004/051 128. filed June 16. 2004 and claims the benefit thereof. The 
International Application claims the benefits of German Patent application No. 103 27 
290.9 DE filed June 17. 2003. both of the applications are incorporated by reference 
herein in their entirety. 

FIELD OF THE INVENTION 
f00021 The invention relates to methods and devices for a simplifying processing of 
signaling information during a dialogue with a speech dialogue system in a packet 
network. 

BACKGROUND OF THE INVENTION 
10003] One of the most important current developments affecting the fields of network 
technologies, of call processing, and of Internet technologies, is the realization of services 
with real-time transmission via packet networks. 

10004] A t the present moment, most speech transmission is handled via line switched 
networks - also known as TDM (time division multiplexing) networks. The aim for the 
future is to transmit a greater amount of speech via packet oriented networks which are 
currently used mainly for data transmission. Hereby, the so called IP (Internet protocol) 
networks are the most important class of packet networks. In addition, in future there will 
be further transmission capacity intensive real-time services, such as, for example, the 
transmission of video data during a video-on-demand service. 

fOOOSl A n important class of real-time services is the automated provision of speech 
or video information. One example of this type of service is given by the recorded 
announcement services known &om TDM networks, e.g. telephone number , 
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annoimcements („the telephone number of the subscriber is ...") or error messages („the 
number you are trying to call is not available"). Thereby, automated information output 
can also contain subscriber specific information (e.g. telephone numbers). Dialogue 
fimctionality is an extension of the announcement fimctionality. There the user can control 
the service or the dialogue by using the keys in his terminal device or by means of speech 
input. Servers are used to achieve such kinds of services in packet networks. In the case of 
interactive services, the terrti FVR (interactive voice response) server is commonly used. 
A number of coding methods or codecs (coder-decoder), such as, for example, G.711A/u, 
G.723.1, G.726, 0.728 and G.729A/B were standardized for the transmission of speech. 
Standards H.26I and H.263, for example, are used for the transmission of video 
information. For an information output, usually a codec or coding method that is 
supported by both ends of the network is selected for the information transmission in a so 
called codec negotiation. 

[00061 F or services with real-time transmission via data networks, it is essential that 
the service characteristics known fix)m the TDM network be provided for corresponding 
or new services with con^arable quality and efficiency. The optimization of the resource 
application plays an important part in this. 

SUMMARY OF THE INVENTION 
100071 T he task of the invention is to improve the efficiency of the resource 
application in the automated information output. 

[00081 T he invention is based on the following consideration. The signaling with 
relation to an interactive dialogue with a speech dialogue system or an IVR (Interactive 
Voice Response) server, e.g. for an output of information, is usually carried out using 
DTMF signals (DTMF: Dual Tone Multiple Frequency). With this signaling - also 
frequently called tone dialing or dual tone frequency dialing - an interaction between the 
subscriber and the speech dialogue system is realized by means of an exchange of coded 
information through frequencies. Three scenarios can be distinguished in the transmission 
of DMFT signals via a packet network: 

[00091 The DMFT signals are contained in the payload stream. In this connection, 
one also talks about in-band tiansmission. In-band transmission is only used in 
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conjunction with non-compFessing coding methods or codecs such as, for example, G.71 1. 



[00101 DMFT signals and payload are transmitted separately, i.e. out-of-band • 
transmission of DTMF signaling information is carried out. 

[QOlll Die DMFT signals are transmitted in the payload stream in separately labeled 
data packets. A transmission of this type was standardized by the EETF in the Request for 
Comments RFC 2833 for the RTF (real-time protocol) packet format. 

[00121 W ith in-band transmission of DTMF signals, usually special hardware 
resoijrces, for example, designed with DSPs (DSP: digital signaling processor) or ASICs 
(ASIC: Application specific integrated circuit) are required in the speech dialogue system 
or the IVR server for the analysis of the DTMF signals. With relation to the invention, in- 
band transmission of DTMF signals is largely avoided and the use of speech dialogue 
systems or IVR servers without hardware resources for the recognition of DTMF signaling 
is proposed. 

100131 T he coding method and the type of exchange of DTMF signals for an 
automated information output is usually determined during a so called codec negotiation 
between packet network terminals. The first packet network terminal is represented, for 
example, by a network inter&ce device or a media gateway or by a packet based terminal 
linked directly to the packet network. The second packet network terminal is the speech 
dialogue system. In the codec negotiation a codec supported by both terminals and by the 
network is selected from a list of codecs. Usually, when a codec is selected, by default or 
by presetting, the type of transmission of the DTMF signals is also determined, e.g. the 
selection of the coding method G. 7 II is linked to in-band transmission of the DTMF 
signals. With relation to the invention, two methods will be shown to exclude in-band 
transmission: 

100141 I n the fust method, only out-of-band-signaling or signaling by means of 
specially labeled data packets is permitted in the codec negotiation. Coding methods that 
involve an in-band signaling of DTMF signals are practically eliminated from the list of 
applicable codecs during the codec negotiation. 
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[00151 T he second method includes an extension of the logic of the control device 
controlling the speech dialogue system. The control device (e.g. a packet based exchange, 
a call server, a proxy server or a soft switch) is embodied such that said control device 
signals to the remote packet network terminal involved in the codec negotiation to use out- 
of-band-signaling of DTMF signals independently of the selected codec. In this case, as a 
rule the codec G.71 1 can also be selected. 

100161 T he invention has the advantage that speech dialogue systems can be used 
without complex hardware resources. In principle, for services with user interaction or 
automated information output, it is then also possible to use so called general purpose 
platforms, that is multi-fimction computers with open interfeces that provide the desired 
IVR or information output resources through their software tools. The provision of 
dedicated hardware is no longer necessary. 

[00171 A ccording to a fiirther embodiment - as a backup so to speak - provision is 
made for the rerouting of the job to a speech dialogue system with dedicated hardware for 
those cases where the in-band signaling of DTMF signals cannot be achieved with the 
above procedure. That would be the case in the first method if no codec supported by the 
two packet network terminals can be identified with out-of-band signaling and there is 
also no provision for signaling by means of a packet specifically provided for DTMF 
signals, hi this case, according to the further embodiment, the service is switched to a 
speech dialogue system with dedicated hardware. With die second method, in the course 
of which, a control device stipulates out-of-band-signaling for the first packet network 
terminal independently of the codec selected, it is possible to forward the call to the 
speech dialogue system with hardware for DTMF signal recognition if the necessary 
resources or technical support for the out-of-band transmission are not available. 

[00181 T he fiirther embodiment allows a service to be dealt with also in cases where it 
is not possible to carry out a service with the speech dialogue system without special 
hardware. Otherwise, service requirements of that type would have to be refused. As a 
rule, however, it will be possible to provide the service through the speech dialogue 
system without special hardware. Therefore, the provision of one backup speech dialogue 
system will be sufficient for a large number of speech dialogue system vwthout special 
hardware. Alternatively, the speech dialogue system with special hardware can be 
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assigned only to one speech dialogue system without dedicated hardware, whereby the 
speech dialogue system without special hardware is accordingly more powerfully sized, 
i.e. with respect to the available resources it is designed for the processing of a lot more 
service requirements per time unit than the other speech dialogue system. 

BRIEF DESCRIPTION OF THE DRAWING 
100191 I n the following, the subject of the invention will be described in more detail 
below with reference to an embodiment using a figure. 

[0020] Figure 1 - figure 1 is an illustration of the preferred embodiment of the 
invention. 

DETAILED DESCRIPTION OF THE INVENTION 
[00211 T here are two IVR servers, FVRl and IVR2, represented in the figure, with the 
first IVR server IVRl having no special hardware for the processing of DTMF signals, the 
second IVR server, however, does. Subscribers TLN are represented that are connected to 
a TDM network PSTN/ISDN. Voice communications of the subscriber TLN can be 
switched via a switching system switch. The TDM network ISDN/PSTN is connected 
with an IP network PNet by means of a media gateway GW. Here, this is, for example, a 
core network. Within the IP network IPNet, signaling information and payload ND are 
routed separately. Signaling information coming from the TDM network IDSN/PSTN is 
transferred via a so called signaling transfer point STP to a call server CS2. Signaling is 
carried out with the help of signals of the SS7 signaling system. Using the MGCP protocol 
(Media gateway Control Protocol), the call server CS2 exchanges signaling information 
with the gateway GW and the speech dialogue systems IVRl and IVR2. The H.248 
protocol could also be used instead of the MGCP protocol. With the help of the MGCP 
(Media gateway Control Protocol) protocol, the call server CSl can also control the 
speech dialogue system IVRl. 

[00221 A ccording to the invention, the selection of a codecs is carried out in the course 
of an interactive dialogue with a subscriber TLN as follows: 

100231 W ith relation to the codec negotiation, a type of DTMF signaling without in- 
band signaling is required through the selection of the coding method, which DTMF 
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signaling occurs in the packet network by means of signaling between the IP terminals and 
the call server CS2. This procedure executes in such a way that the A-side IP terminal, 
represented by the gateway GW, signals a prioritized list of voice codecs, fax and tone 
capabilities to the call server CS2 during the call set-up. In compliance with administrable 
defaults, the call server CS2 can delete from the list the voice codecs/ capabilities that 
should not be used in the network, or said call server can change the priorities. The 
modified list is delivered to the B-side IP terminal, in this case the speech dialogue system 
IVRl. This system compares the list received via the signaling list with its own list and 
eliminates the voice codecs/ capabilities that are not contained in both lists. The list thus 
checked and possibly modified is signaled back to the A-side IP terminal via the call 
server CS2, and sets the selection of voice codecs/ capabilities that are to be used. 

100241 D uring the codec negotiation the speech dialogue system IVRl only offers the 
signaling to RFC2833 and compressing voice codecs, which do not permit an in-band 
DTMF signaling and therefore inevitably result in a DTMF out-of-band signaling. That 
means that in special the usually used, but DTMF transparent regression coders (e.g. 
G.71 1) are not contained in the codec list signaled back by the IVR. The demand for the 
DTMF out-of-band signaling is made by the call server CS2 by means of signaling to the 
A-side IP terminal. To this end the call server CS2 has a logic that checks the voice codec 
negotiated by the codec negotiation. If this is a compressing voice codec (e.g. G.723), the 
call server signals the DTMF out-of-band transmission to the A-side IP terminal. 

[00251 T here is a peripheral device with virtual announcement and/or dialogue ports in 
the call server CS2. The speech dialogue system IVRl and possibly also the speech 
dialogue system IVR2 are controlled via said peripheral device. This peripheral device 
with virtual announcement and/or dialogue ports converts the seizures of its ports by the 
call server CS2 into sei2xire signaling of the allocated ports leading to the speech dialogue 
systems. This peripheral device also outputs the jobs to play announcements and dialogues 
towards the speech dialogue systems. Acknowledgements from the speech dialogue 
systems IVRl or IVR2 indicating the end of the announcement or containing the input of 
the end user ensue at the assigned peripheral device with virtual announcement and/or 
dialogue port. All the signaling between the peripheral device responsible for the virtual 
aimouncement and/or dialogue port and the assigned external speech dialogue systems 
rVRl or IVR2 ensues via the signaling protocol MGCP that is used to access the media 
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gateway MG. 

100261 F or the case that the A-side IP terminal does not support any of the voice 
codecs offered by the IVRl that lead to DTMF out-of-band signaling, or RFC2833, the 
service requirement is automatically rerouted or forwarded to an alternative speech 
dialogue system rVR2 that also supports the voice codecs with in-band DTMF signaling. 
Since, as a rule, there is only a small number of IP terminals that exclusively support voice 
codecs with in-band DTMF signaling, the channel number of the speech dialogue system 
IVR2 can be substantially smaller than that of the speech dialogue system IVRl, thus 
achieving an optimization of costs for the overall FVR functionality to be made available. 

[00271 T he codec negotiation that takes place during the call set-up is used as a trigger 
event for the rerouting of the speech dialogue system IVRl to the speech dialogue system 
IVR2. The speech dialogue system IVRl determines on the basis of the codec negotiation 
that there is no match between the voice codecs of the A-side IP terminal and of the 
speech dialogue system IVRl, and signals a corresponding error (e.g. error code 543 
„Codec Negotiation Error") to the peripheral device with virtual announcement and/or 
dialogue port in the call server. The peripheral device evaluates this error and, by means of 
tile data link control in the call server, initiates thereupon a removal of the connectioii to 
the speech dialogue system IVRl, followed by a setting up of the connection to the speech 
dialogue system .rVR2. The connection to the A-side IP terminal is maintained during this 
rerouting procedure. The addresses of the speech dialogue systems IVRl and IVR2 are 
administered in the database of the call server CS2. 

f00281 I t is also conceivable that the speech dialogue system with special hardware 
rVR2 is controlled by a different call server from the call server CS2, e.g. by the call 
server CSl. In this case, the rerouting from the speech dialogue sjrstem IVRl to the speech 
dialogue system IVR2 can be achieved by exchanging appropriate signaling ioformation 
between the two call servers CS2 and CSl e.g. by means of the BICC (Bearer Independent 
Call Control) protocol. 

[00291 A n alternative procedure is based on the extension of the logic in the call 
server CS2 to require the DTMF out-of-band signaling. This extension is that the logic 
checks whether the B-side IP terminal is the speech dialogue system FVRl . In this case the 
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speech dialogue system IVRi also offers the non-compressing voice codecs, and the call 
server CS2, independently of the selected voice codec, always signals DTMF out-of-band 
transmission to the A-side IP terminal. 
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